# English generation
Qwen3 8B GGUF
MIT
ZeroWw is a quantized text generation model that uses f16 format for output and embedding tensors, while other tensors use q5_k or q6_k format, resulting in a smaller size with performance comparable to pure f16.
Large Language Model English
Q
ZeroWw
236
1
Olmo 2 0325 32B Instruct 4bit
Apache-2.0
This is a 4-bit quantized version converted from the allenai/OLMo-2-0325-32B-Instruct model, optimized for the MLX framework and suitable for text generation tasks.
Large Language Model
Transformers English

O
mlx-community
270
10
Mistral Small Instruct 2409 Abliterated
Other
This is an ablated model based on mistralai/Mistral-Small-Instruct-2409, mainly used for text generation tasks.
Large Language Model
Transformers Supports Multiple Languages

M
byroneverson
11.24k
14
GIGABATEMAN 7B GGUF
GIGABATEMAN-7B is a 7B-parameter large language model based on the Mistral architecture, focusing on text generation tasks.
Large Language Model English
G
mradermacher
115
3
Blockchainlabs 7B Merged Test2 4 Prune Sft 4bit DPO Orca
This is a small 7B-parameter LLM optimized for device-side use, pruned and trained with DPO
Large Language Model
Transformers English

B
alnrg2arg
18
2
Cerebras GPT 13B
Apache-2.0
Cerebras-GPT 13B is a large language model trained based on an open architecture and dataset. It belongs to the Cerebras-GPT series and aims to study the scaling laws of large language models and demonstrate the simplicity and scalability of training on the Cerebras software and hardware stack.
Large Language Model
Transformers English

C
cerebras
669
647
Featured Recommended AI Models